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DEAD-box proteins are superfamily 2 helicases that function 
in all aspects of RNA metabolism. They employ ATP binding 
and hydrolysis to generate tight, yet regulated RNA binding, 
which is used to unwind short RNA helices non-processively 
and promote structural transitions of RNA and RNA-protein 
substrates. In the last few years, substantial progress has been 
made toward a detailed, quantitative understanding of the 
structural and biochemical properties of DEAD-box proteins. 
Concurrently, progress has been made toward a physical 
understanding of the RNA rearrangements and folding steps 
that are accelerated by DEAD-box proteins in model systems. 
Here, we review the recent progress on both of these fronts, 
focusing on the mitochondrial DEAD-box proteins Mss116 and 
CYT-19 and their mechanisms in promoting the splicing of 
group I and group II introns. 



Introduction 

DEAD-box proteins make up the largest group of nucleic acid 
helicases. 1 They comprise a family within helicase superfamily 2 
and function in essentially all facets of cellular RNA metabolism, 
including transcription, mRNA and tRNA processing, protein 
synthesis, RNA nuclear export and RNA degradation. 2,3 Many 
of these processes involve RNAs with defined structures, 4 " 8 and 
the most common role of DEAD-box proteins is to use cycles 
of ATP binding and hydrolysis to promote conformational rear- 
rangements of these RNAs during their biogenesis or in the 
course of their cellular functions. 9,10 Some DEAD-box proteins 
are directed to individual RNA targets, whereas others function 
more broadly as general chaperones in RNA folding and RNA- 
protein assembly processes," in line with an early suggestion that 
the fundamental properties of RNA are likely to lead to a general 
requirement for chaperones in RNA folding. 12 
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Along with the advances in understanding the physiological 
roles of DEAD-box proteins, much has been learned about their 
biochemical properties. Their most fundamental property is that 
they possess ATPase activity that is stimulated by RNA. 13 Another 
feature that appears to be universal is that the ATPase cycle is 
linked to a cycle of changes in affinity for RNA. 14 These basic 
properties are used in different ways by different DEAD-box pro- 
teins for a broad range of biochemical activities and physiological 
roles. DEAD-box proteins commonly have the ability to use ATP 
to unwind short RNA duplexes, as well as additional activities such 
as disruption or prevention of RNA-protein interactions 15 " 17 and 
acceleration of RNA helix formation (RNA strand annealing). 18 " 21 

Determining which properties are general for all DEAD-box 
proteins and which are specific to individual ones is critical for 
developing a molecular understanding of the functions of these 
remarkable enzymes. Toward this objective, here we first outline 
our current understanding of the biochemical properties and 
activities of DEAD-box proteins, focusing on recent develop- 
ments in this dynamic field. We then describe how these activi- 
ties are understood to underlie the ability of DEAD-box proteins 
to promote RNA structural rearrangements, with an emphasis on 
the yeast mitochondrial (mt) protein Mssll6 and its Neurospora 
crassa ortholog CYT-19. These proteins function as general RNA 
chaperones for the folding of mt group I and group II introns 
and participate in translational activation and other RNA pro- 
cessing reactions. 22 " 24 They have been the focus of intense bio- 
chemical and biophysical dissection in recent years and represent 
well-defined systems for exploring the relationship between the 
biochemical properties and physiological roles of DEAD-box 
proteins. We include results delineating biochemical properties 
of other DEAD-box proteins both to emphasize points of gen- 
erality and to highlight important differences between different 
DEAD-box proteins. 

Structural Characteristics and Biochemical Activities 
of DEAD-box Proteins 

Crystallographic analyses have revealed structural features that 
are universal to DEAD-box proteins. 25 " 35 Like other superfamily 
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Figure 1. Core domain structure of DEAD-box proteins. The top panel shows a stereoview of Mss116 bound to AMP-PNP and ssRNA (U m ). 31 Motifs that 
interact with ATP are orange and motifs that interact with ssRNA are green. Motif III, which does not contact ATP or ssRNA but is involved with commu- 
nication between the two domains and with substrates, is olive. Dl is dark gray, DM is silver, the linker between them is dark blue and the CTE, present 
in Mss116 and CYT-19, is light blue. Below the structure is a cylinder representation showing the domains and motifs in the same colors. Also shown are 
the N-terminal extension (NTE) and the C-tail of Mss116, which were not present or not resolved in the crystal structures (refs. 31 and 41). 



1 and 2 helicases, they possess a conserved core consisting of two 
RecA-like domains that are connected by a flexible linker. At least 
13 conserved motifs lie within these domains and contribute to 
interactions between the two domains or with substrates (Fig. 1). 
Crystal structures of ternary complexes of DEAD-box proteins 
with a bound adenosine nucleotide analog and small ssRNA have 
shown that ATP binds to a site that is largely contained within 
domain 1 but also includes amino acid residues from domain 2. 
Within domain 1, the triphosphate moiety interacts with motif 
I, which includes a sequence known as the P-loop or Walker A 
motif, and motif II, which includes the eponymous D-E-A-D 
sequence. In addition, the base-pairing face of the adenine nucle- 
otide forms specific interactions with a glutamine of the Q-motif, 
resulting in a strong preference in DEAD-box proteins for ATP 
over other nucleoside triphosphates. 36,37 Contacts with domain 

2 include interactions of the triphosphate with motif VI and the 
ribose sugar with motif Va. Motif III lies near the 7-phosphoryl 
group of ATP at the interface of the two domains. On the oppo- 
site side of the protein, single-stranded RNA (ssRNA) also binds 
across the domain interface to a site that includes contacts with 
motifs la, lb and Ic from domain 1, along with motifs IV, IVa, V 
and Vb from domain 2. 

A universal feature of DEAD-box proteins is that bound 
ssRNA undergoes a sharp bend that is induced by ionic and 
H-bond interactions with a wedge helix containing motif Ic in 
domain 1. 28-35 The requirement for ssRNA to bind in this confor- 
mation, which is incompatible with helix formation, contributes 
to generating the local strand separation that is used for RNA 
unwinding. 38 " 40 Additionally, the structure of Mssll6 revealed a 



second bend in the bound ssRNA, resulting in RNA crimping. 31 
This second bend is caused primarily by a steric block between 
nucleotides near the 5' end of the bound ssRNA and a C-terminal 
extension (CTE) of domain 2. 41 

In addition to the conserved core, most DEAD-box proteins 
have extensions or ancillary domains, which differ for different 
DEAD-box proteins and can play a variety of roles. The ancil- 
lary domains of a number of DEAD-box proteins interact with 
RNA or protein components of substrates, thereby targeting 
individual DEAD-box proteins to their physiological targets. 9,10 
An archetypal protein of this type is the bacterial DbpA/YxiN 
protein, which has an ancillary C-terminal domain that binds 
specifically to a helix and adjacent structure within the large 
subunit rRNA. 42 ' 44 Ancillary domains can also mediate protein 
dimerization 45 or regulate the functional properties of the core, 34 
and in some related proteins they contribute additional enzy- 
matic activities. 46,47 In Mssll6 and CYT-19, domain 2 contains 
an ct-helical CTE of -100-amino-acids followed by a positively 
charged, 60-75 amino acid segment termed the C-tail (Fig. 
1). 31,48 The CTE stabilizes domain 2 and induces the second 
RNA bend noted above, resulting in RNA crimping, 31 while the 
C-tail contributes to the non-specific binding of RNA substrates 
(see below). 48,49 

Cooperative binding of RNA and adenosine nucleotide. At 

the heart of the ATP-dependent activities of DEAD-box proteins 
is a cycle of RNA affinity changes that is tightly linked to the 
ATPase cycle. The effect of these affinity changes is to gener- 
ate strong, yet regulated binding to a strand of RNA, displacing 
this segment from a partner strand and temporarily preventing 
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it from forming intramolecular or intermolecular contacts with 
other RNA segments or proteins components. 

Indeed, many DEAD-box proteins have been shown to bind 
cooperatively to ssRNA and adenosine nucleotides. Cooperativity 
was demonstrated for the mammalian eIF4A protein by steady- 
state ATPase activity 14 and for E. colt DbpA and other DEAD- 
box proteins by binding assays using nucleotide analogs such as 
AMP-PNP. 37 ' 50 " 53 A critical feature of the ATPase cycle is that the 
level of cooperativity changes dramatically at different stages, 
regulating the affinity for RNA. In general, positive cooperativity 
is associated with the presence of the 7-phosphoryl group of the 
adenosine nucleotide (Fig. 2). Thus, in the presence of ATP or 
an ATP analog, binding of RNA is typically tightened. However, 
a limitation of ATP analogs is that they do not mimic all of 
the binding properties of ATP, as most do not stimulate RNA 
unwinding, 39,40,54 so it is unclear whether the level of cooperativ- 
ity is the same as with ATP. In contrast, ADP binding is typically 
observed to give energetically independent binding of RNA 55 or 
anti-cooperativity. 14,37 

To understand and predict the functional properties of a given 
DEAD-box protein, it is necessary to have a detailed, quantita- 
tive understanding of the energetic coupling between nucleotide 
and RNA binding at each stage of the ATPase cycle. A key step 
was taken recently for Mssll6 (Fig. 2). 56 Fluorescence approaches 
with a truncated protein construct that includes the tandem 
RecA-like core domains and the CTE but lacks the C-tail and 
a short N-terminal extension (NTE) showed that ATP binding 
increases the affinity for a small double-stranded, hairpin RNA 
(2-7-fold). In contrast, RNA binding is negatively coupled with 
ADP binding (3-fold). Further experiments showed that RNA 
accelerates hydrolysis of ATP and suppresses its re-synthesis on 
the enzyme, such that there is a dramatic shift of the equilibrium 
toward products. This knowledge allows calculation of the affin- 
ity for RNA when the protein is bound to ADP and Pj (denoted 
the ADP-P. state), which indicates very tight RNA binding (140- 
500-fold tighter than with ATP). The RNA affinity cycle that 
emerges has important parallels with that derived from the same 
approaches for DbpA. 55 For both proteins, the tightest binding of 
RNA occurs in the ADP-P. state and is substantially weaker in 
the ATP and ADP-bound states. Unlike Mssll6, cooperativity 
with ATP and anti-cooperativity with ADP were not detected 
for DbpA. 

Key to understanding how DEAD-box proteins manipulate 
RNA structures is a molecular understanding of the conforma- 
tional changes that occur during the ATPase cycle and underlie 
the cooperativity between RNA and adenosine nucleotide. All 
current models link positive cooperativity to the closing of the 
two core domains. 9,10,29 In the absence of substrates, DEAD-box 
proteins populate conformational ensembles in which the two 
core domains are separated and have some independent mobility. 
This property was noted in the earliest crystal structures, 25,26 and 
more recently has been observed for DbpA in fluorescence reso- 
nance energy transfer (FRET) experiments 53 and for Mssll6 and 
CYT-19 by small angle X-ray scattering (SAXS). 57 Upon bind- 
ing RNA and ATP or an ATP analog, the domains pack against 
each other in a 'closed' conformation. 28 " 31,33 " 35 In this closed 




conformation, both ligands form contacts with both domains 
and thus the domain closure is highly likely to be the main source 
of cooperativity. 29 The increase in cooperativity in the transition 
from the ATP to the ADP-P. state most likely reflects rearrange- 
ments that occur upon ATP hydrolysis during or after core clo- 
sure and affect the affinity for the bound RNA. 

There are two key predictions from this model. First, ligand 
combinations that do not give cooperative binding should not give 
stable formation of the correct closed conformation. Consistent 
with this prediction, ADP and RNA binding to DbpA do not 
give a stable, closed conformation as viewed by FRET. 53 Second, 
the cooperativity of binding might be particularly sensitive to the 
domain interface. Indeed, mutations in motifs III and Va, both 
of which line the domain interface, have been observed to reduce 
the cooperativity without substantially weakening binding of the 
individual ligands. 58,59 

ATPase cycle kinetics. Knowledge of the relative and absolute 
rate constants for individual steps in the ATPase cycle is another 
key toward understanding how DEAD-box proteins carry out 
RNA remodeling reactions. This knowledge is necessary to 
understand which functional states of the DEAD-box protein 
will accumulate and to relate the time scales necessary for the 
DEAD-box protein actions to the subsequent RNA folding steps 
and rearrangements. While the absolute rate constants will dif- 
fer for different proteins and may depend on the specific RNA 
substrate and on solution conditions, there are likely to be general 
themes for the relative rate constants that are useful as a starting 
point. 

A general picture of the ATPase kinetics of DEAD-box pro- 
teins has emerged from extensive data sets for DbpA and Mssll6 
(Fig. 3), 55,56 along with previous work on eIF4A and the yeast 
protein Dedl, which functions in the nucleus and cytosol in 
processes including RNA splicing and translation. 60 ATP and 
RNA can bind in either order. 14 The affinity for ATP is typically 



Figure 2. RNA affinity changes resulting from changes in cooperativity 
during the ATPase cycle. In the absence of nucleotide (No nt), coop- 
erativity is absent by definition. Cooperativity is typically observed in 
the ATP-bound state and is maximal in the ADP-P. state, as shown by 
the thickening blue stripe. Anti-cooperativity in the ADP-bound state 
is shown by the gray stripe. Specific values of cooperativity for Mss116 
are indicated for each state. 56 The colors for the nucleotide states match 
those in Figure 3. 
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Figure 3. ATPase kinetics of DEAD box proteins. Specific rate and equilibrium 
constants are shown for Mss116 (data from ref. 56) The domain structures shown are 
based on SAXS studies of the open and closed complexes. 57 Nucleotides are abbrevi- 
ated by T (ATP), D-P. (ADP and P.) and D (ADP). The ADP-P. state is highlighted in red to 
indicate that it is populated to the greatest extent in the steady-state with saturating 
ATP and RNA. 



between 50 and 500 |jlM in the absence of RNA 14 ' 23 ' 55 ' 56 ' 61 and 
can vary across this range for different proteins and even for 
the same protein under different solution conditions. 14 In the 
absence of RNA, ATP hydrolysis is slow (< 0.3 sec" 1 ), such that 
the equilibrium constant for binding is expected to be equal to 
the K M measured by steady-state ATPase activity, and thus the 
affinity estimates above include K u values. Upon binding RNA, 
ATP hydrolysis is stimulated to 5-10 sec" 1 , generating an ADP-P. 
state that remains bound to RNA. As noted above, RNA affin- 
ity is maximized in this state. With bound RNA, ATP hydro- 
lysis is largely irreversible because P. release is faster than ATP 
re-synthesis, but release of P. is slow enough to be at least partially 
rate-limiting for the overall ATPase cycle (2—6 sec" 1 ). Therefore, 
under conditions of saturating ATP and RNA, the protein spends 
a significant fraction of the time in the tight-binding, ADP-P. 
state. 55,56 Upon release of P., the affinities for RNA and ADP are 
dramatically reduced. ADP is released rapidly (30-100 sec" 1 ), and 
RNA presumably is as well. At this point, the protein can re-bind 
weakly to RNA and can again bind ATP to begin another cycle. 
It may also re-bind ADP, which for some DEAD-box proteins 
binds substantially tighter than ATP. 14 ' 55 In vivo, at cellular con- 
centrations of ADP, the net exchange of ADP for ATP may be 
partially rate-limiting for the overall cycle for some DEAD-box 
proteins. 55 

For many DEAD-box proteins, another level of regulation 
in the ATPase kinetics is conferred by interactions with partner 
proteins. There is no evidence for such regulation for Mssll6 
or CYT-19, which can function by themselves as general RNA 
chaperones in vitro (see RNA remodeling functions of DEAD-box 
proteins below). On the other hand, a growing number of DEAD- 
box proteins are known to interact with proteins that modulate 
the rates of individual steps in the ATPase cycle and thereby 
influence the timing of assembly and disassembly with RNA. 



The eIF4A protein interacts with other components 
of the translation initiation machinery, including 
eIF4G and eIF4B, and these interactions acceler- 
ate the ATPase cycle. 62 " 66 Co-crystal structures and 
single molecule FRET experiments indicate that the 
binding of eIF4G promotes a conformation of the 
two core domains of eIF4A that is neither fully open 
nor fully closed, 64,67 presumably increasing the rates 
of nucleotide exchange and P. release. 64 Interestingly, 
Dedl also forms a complex with eIF4G, other pro- 
teins of the eIF4F complex, and mRNA, but the 
complex is arrested rather than accelerated through 
the ATPase cycle. 68 

For two other DEAD-box proteins, recent evi- 
dence indicates that interactions with partner pro- 
teins tightly regulate the linked RNA-binding and 
ATPase cycles and are essential for the biological 
roles of these proteins. The first is eIF4A-III, which 
functions as a clamp in the exon junction complex 
(EJC). 28,30,33,69 When bound to mRNA and the other 
protein components of the EJC, Magoh and Y14, 
eIF4A-III is arrested in the ADP-P. state, resulting in 
a very stable complex. 33 The protein PYM promotes 
dissociation of this complex, 70 but its effects on the ATPase kinet- 
ics of eIF4A-III have not been explored. 

Partner proteins are also critical for the ATPase cycle of Dbp5 
(DDX19 in humans), which functions at the cytoplasmic face of 
the nuclear pore to promote export of RNA. 17,71 " 73 Dbp5 interacts 
strongly with the nucleoporin Nupl59 (NUP214 in humans). In 
addition to localizing Dbp5 at the nuclear pore, this interaction 
accelerates ADP release by Dbp5, which is suggested to be inher- 
ently slow. 74 The Glel protein, which is also localized at the cyto- 
plasmic face of the nuclear pore, interacts strongly with Dbp5 
in the presence of the cofactor IP fi . Interestingly, Glel-IP 6 binds 
to the helicase core at the same site that eIF4G binds to eIF4A, 
and in the complex the two core domains of Dbp5 are separated 
and the RNA binding site is disrupted. 75 Thus, Glel-IP 6 bind- 
ing is strongly anti-cooperative with RNA binding, which may 
result in rapid release of Glel-IP 6 upon binding of RNA 74 and/ 
or acceleration of RNA release to allow recycling of Dbp5. 75 On 
the other hand, Glel-IP 6 binding is cooperative with nucleotide 
binding, perhaps contributing to the requirement for Nupl59 to 
accelerate ADP release. 74 

ATP utilization during RNA unwinding. A key goal for a 
mechanistic understanding of DEAD-box proteins is to under- 
stand how the ATPase cycle is coupled to local RNA unwinding. 
Although our understanding remains incomplete, considerable 
progress has been made in the last few years. It was shown for 
Dedl, Mssll6, eIF4A and DbpA/YxiN that complete unwind- 
ing of relatively short duplexes can be achieved in the presence 
of the analog ADP-BeF x . 40,54 This finding indicates that unwind- 
ing occurs without ATP hydrolysis and that ATP hydrolysis is 
required primarily to generate the weaker binding states that 
allow release of the bound RNA strand and recycling of the 
enzymes. 
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In a separate study, ATP hydrolysis and RNA unwinding were 
measured in parallel to determine the number of ATPs consumed 
during unwinding. 39 The implication from the finding above is 
that unwinding would require a single cycle of ATP binding and 
hydrolysis, and indeed a stoichiometry of one ATP per duplex 
unwound was observed for short duplexes under a range of condi- 
tions. Together, these results provide strong evidence for a simple 
coupling between the ATPase cycle and the RNA binding cycle 
and a mechanism of unwinding that does not involve multiple 
cycles with directional translocation. 38,76 This model is consistent 
with the known ability of most DEAD-box proteins to unwind 
short RNA helices but not helices longer than 10-15 bp. 18,77,78 It 
was also shown that more ATP is hydrolyzed during unwinding 
of longer or more stable helices, indicating that ATP hydrolysis 
does not always lead to complete unwinding of such duplexes and 
implying that some ATPase cycles are futile. 79 Results suggesting 
hydrolysis of a single ATP during unwinding of short RNA heli- 
ces have since been obtained for other DEAD-box proteins and 
one protein from another SF2 family, 56,80,81 suggesting that this 
general mechanism applies to a range of helicases. 

On a molecular level, the model that emerges is that binding 
of ATP and dsRNA can lead to a conformational change that 
generates local strand separation. For a short, relatively unstable 
duplex, this conformational change and resulting strand sepa- 
ration can be sufficient to generate complete unwinding, and 
the strand that is not tightly bound by the DEAD-box protein 
dissociates. The conformational change and strand separation 
would follow ATP and RNA binding and be slower than these 
processes, but it would be faster than ATP hydrolysis. The rate 
and extent of this conformational change could also depend 
on the stability of the RNA duplex, because this would influ- 
ence the energy barrier that must be overcome. As expected 
for nucleotide binding followed by a conformational change, 
multi-phase kinetics have indeed been observed in kinetics 
experiments monitoring nucleotide binding. 55,56,64 A recently 
published paper has provided detailed insight into the structural 
basis for RNA duplex recognition and unwinding by Mssll6 
and a comprehensive structural model for RNA unwinding by 
DEAD-box proteins. 82 

RNA interactions formed by regions outside the helicase 
core. In addition to interacting with dsRNA and ssRNA at the 
canonical RNA binding site within the core, some DEAD-box 
proteins use additional RNA interactions to localize the core to 
the physiological targets. Molecular features have been elucidated 
for two types of interactions. Mssll6 and CYT-19 possess at the 
extreme C-terminus a stretch of 60-75 amino acids termed the 
C-tail (Fig. 1). This sequence is hydrophilic, rich in arginine and 
predicted to be unstructured in solution. 48 The C-tails of both 
proteins are easily removed by limited proteolysis, suggesting 
that they are connected by a relatively accessible linker. 49,56 RNA 
unwinding by CYT-19 is enhanced by RNA structure adjacent to 
the helix substrate, regardless of whether the extension is ssRNA, 
dsRNA, dsDNA, or a higher-order structured RNA, 83 and this 
enhancement is dependent on the C-tail. 49 These results suggest 
that the C-tail contacts the adjacent RNA and tethers the core for 
RNA unwinding. 49,83 



A recent study using SAXS supports and extends this model. 57 
For both CYT-19 and Mssll6, either alone in solution or with 
the core bound to ADP-BeF and a short ssRNA, the scatter- 
ing profiles indicate that the C-tails occupy space adjacent to 
domain 2 and are largely unstructured. SAXS data from com- 
plexes with nucleic acid constructs that include extensions to the 
ssRNA showed that the C-tail is flexible and can move through 
a wide arc to interact with extensions at either end of the ssRNA 
(Fig. 4A). These results suggest that the C-tail provides a flexible 
tether, which binds non-specifically to large RNA substrates and 
allows the helicase core to act repeatedly on neighboring regions 
(Fig. 4B). 

A detailed understanding of the RNA targeting interactions 
has been achieved for E. coli DbpA and its B. subtilis ortholog 
YxiN. A C-terminal domain of this protein adopts the fold of 
an RNA recognition motif (RRM) and binds specifically to a 
hairpin structure within the 23S rRNA (helix 92), localizing 
DbpA for participation in biogenesis of the ribosomal large sub- 
unit. 42,84,85 A co-crystal of the RRM domain of YxiN and an 
RNA that included helix 92 and two additional helices showed 
that the RRM domain binds helix 92 and an adjacent junc- 
tion in a mode different from that observed previously for other 
RRMs. 44 The flexible linkage between the RRM domain and the 
helicase core 86 may permit the RRM domain to form a tethering 
interaction while the core interacts with nearby RNA structures, 
analogous to the tethering interaction of the C-tails of Mssll6 
and CYT-19 but in this case at a specific location within the 23S 
rRNA (Fig. 4C). 

RNA Remodeling Functions of DEAD-box Proteins 

DEAD-box proteins facilitate rearrangements of most of the 
known structured RNAs. These RNAs range from the relatively 
simple self-splicing introns to complex RNA-protein machines 
such as the ribosome and spliceosome. Three DEAD-box pro- 
teins, along with five proteins from other SF2 families, are 
required for the extensive rearrangements that occur during the 
formation and function of the spliceosome. 87,88 Analogously, 
several DEAD-box and related proteins are required for proper 
folding and assembly of the ribosome. There are excellent recent 
reviews on the roles of DEAD-box proteins in these complex 
machines, 6,89,90 and here we focus on simpler RNA systems, spe- 
cifically group I and group II introns. These RNAs have been 
studied for decades as models of RNA tertiary folding and cataly- 
sis, and they allow a broad range of approaches for probing the 
mechanisms of DEAD-box proteins in facilitating RNA folding 
processes. A key goal is to develop a molecular understanding 
that directly and quantitatively connects the biochemical proper- 
ties of DEAD-box proteins to their physiological roles in these 
processes. Although we have not yet achieved this goal, work in 
the last few years has increased our understanding of RNA fold- 
ing processes that are accelerated by DEAD-box proteins and the 
biochemical activities that are required in vitro and in vivo. 

Group I introns. Group I introns have been valuable model 
systems for studying the mechanisms of DEAD-box proteins 
as general RNA chaperones. 11 ' 91 These introns are autocatalytic 
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Figure 4. RNA interactions with regions of DEAD-box proteins outside the helicase core. (A) The C-tails of Mss116 and CYT-19 are unstructured in 
solution and are flexible, as revealed by SAXS experiments, 57 and are able to move relative to the core across a wide region of space. (B) Interactions of 
Mss116 and CYT-19 with structured RNA. The high flexibility of the C-tail is suggested to allow it to contact structured RNA and to tether the helicase 
core in proximity to structured RNA, while allowing the core to move to sample different regions of the RNA. The figure shows a model of the Tetrahy- 
mena group I intron 150 and a model from the solution and crystal structures of Mss116. 57 (C) A tethering interaction is formed by the C-terminal domain 
of DbpA/YxiN with a specific hairpin structure of a 23S rRNA precursor. Panels A and B are reprinted from ref. 57 and panel C is reprinted from ref. 44 
(Copyright 2010), with permission from Elsevier. 



RNAs that consist of a conserved catalytic core and various 
peripheral structural elements. 92,93 The catalytic core consists of 
two extended helical domains (the P4-P6 and P3-P9 domains), 93 ' 94 
and these domains interact to form the RNA's active site, which 
catalyzes a two-step splicing reaction. 95,96 The core is stabilized by 
interactions with and between the peripheral elements, and most 
group I introns are also stabilized by protein splicing factors that 
bind specifically to the intron RNA and are required for efficient 
splicing in vivo. 97 

In addition to splicing factors, the efficient splicing of sev- 
eral mt group I introns in N. crassa and all group I introns in S. 
cerevisiae requires the DEAD-box proteins CYT-19 and Mssll6, 
respectively. 22 " 24 In N. crassa, CYT-19 functions in group I intron 
splicing with the splicing factor CYT-18 (the mt tyrosyl-tRNA 
synthetase), 23 whereas in S. cerevisiae, Mssll6 functions in con- 
cert with a number of different splicing factors that act on differ- 
ent group I introns. 24 In contrast to splicing factors, which bind 
specifically to the intron RNAs and stabilize the active RNA 
structure, CYT-19 and Mssll6 bind group I intron and other 
RNAs non-specifically and function primarily to resolve stable 
inactive or intermediate structures ("kinetic traps") that limit 
the rate of RNA folding." As expected from their non-specific 
RNA binding, CYT-19 and Mssll6 can readily substitute for 
each other or be replaced by other DEAD-box proteins, such as 
Dedl, in group I intron splicing reactions in vitro or in vivo. 23,24,98 
Studies using a protein-dependent in vitro splicing system for the 
yeast mt group I intron aI5(3 showed that Mssll6 acting after 
binding of a structure-stabilizing splicing factor accelerates the 
second splicing step, exon ligation, for that intron. 99 



It has been appreciated for decades that RNAs are prone to 
misfolding, starting from pioneering studies with tRNA and 5S 
rRNA. 100,101 More recently, misfolding has been demonstrated for 
several group I introns (described further below), 102 " 107 the bacte- 
rial RNase P RNA, 108 and the hepatitis delta virus ribozyme, 109 
leading to recognition of misfolding as a dominant theme in RNA 
folding. 91,110 A notable exception was thought to be the Azoarcus 
group I intron ribozyme, which is roughly half the length of the 
Tetrahymena ribozyme and forms tertiary structure within milli- 
seconds. 111 However, recent studies of that intron revealed one or 
more intermediates that persist on longer time scales 112 and refold 
slowly to the native state at 25°C. 113 

The group I intron from the large subunit rRNA of 
Tetrahymena thermophila, first shown to be catalytic by Cech 
and colleagues," 4 has been used extensively in studies of RNA 
folding. Upon addition of Mg 2+ in vitro, it was shown to adopt 
misfolded conformations that included alternative base pairs 
within the 5' exon, 103,115 and subsequent work provided evidence 
for non-native structures within the intron as well." 6 The RNA 
chaperone activity of DEAD-box proteins was first demonstrated 
by showing that CYT-19 could accelerate splicing by resolving 
kinetic traps in variants of the Tetrahymena intron, and later work 
showed that this general chaperone function could be performed 
equally well by Dedl. 23,98 

To probe folding of the intron portion in greater detail, further 
studies have used "ribozyme" constructs, in which the exons are 
removed, leaving an RNA that acts as a true enzyme by binding 
and cleaving a substrate oligonucleotide in a reaction that mimics 
the first step of splicing." 7 In addition to simplifying the folding 
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process by eliminating effects of exons, 
such constructs offer the advantage that 
the fraction of native ribozyme can be 
determined in a robust and versatile way 
by measuring the fraction of substrate 
that is cleaved rapidly upon addition 



at various 



fold 



ing times. 



Chemical 



footprinting and catalytic activity 
approaches showed that the standard 
ribozyme version of the Tetrahymena 
intron folds through a series of interme- 
diates, 120,121 and most of the population 
folds to a long-lived misfolded interme- 
diate. 105,122 " 125 This misfolded intermedi- 
ate is extensively structured, including 
a large amount of native structure and 
refolds to the native state slowly, on the 
time scale of hours under typical in vitro 
conditions. 122,126 The refolding requires 
extensive transient unfolding, which 
is thought to be necessary to allow a 
topological change within the ribozyme 
core. 126 

Because the misfolded conformation 
of the Tetrahymena ribozyme is formed 
by a large fraction of the RNA and is 
sufficiently long-lived that a nearly 
homogeneous population can be gener- 
ated experimentally, its refolding reac- 
tion can be readily studied in greater 

depth. Building on earlier work, 23 catalytic activity measurements 
showed that CYT-19 accelerates refolding of the misfolded ribo- 
zyme to the native state and is not needed for subsequent catalytic 
activity. 83 Because the refolding reaction is rate-limited by the 
transient disruption of tertiary structure, 126 CYT-19 is inferred 
to accelerate the loss of tertiary structure. Consistent with this 
interpretation, the acceleration depends on ATP and is enhanced 
by mutations in the ribozyme that weaken tertiary structure. 83,127 

Further work showed that CYT-19 does not preferentially 
recognize specific structural features of the misfolded RNA and 
can also unfold the native ribozyme, but the unfolding activity 
is much less efficient for the native state because this structure is 
more stable. 127 These results suggest a general model for chaper- 
one activity of DEAD-box proteins in which cycles of non-spe- 
cific RNA unfolding and refolding can give additional chances 
for folding to the native state, which accumulates if it is more 
stable than the misfolded state (Fig. 5). 

Unfolding of the native Tetrahymena ribozyme has also been 
used as a quantitative assay to probe the mechanisms of RNA 
unfolding by DEAD-box proteins. It was shown that Mssl 16, like 
CYT-19, can unfold the native ribozyme. 128 A mutation of motif 
III of Mssl 16 (SAT->AAA), which weakens ATP-dependent 
RNA unwinding by Mssl 16, also decreases the efficiency of this 
unfolding reaction and by a similar amount. 128 For the wild- 
type protein, the rate constant for unfolding the ribozyme was 
at least 0.1 sec" 1 and did not reach saturation under the available 




Misfolded 



Figure 5. Model for general chaperone activity of DEAD-box proteins. The model is based on refold- 
ing of the Tetrahymena group I intron ribozyme by CYT-19. ATP-dependent unfolding of misfolded 
conformers allows the RNA additional chances to fold to the native state (green). Although the 
unfolding process mediated by CYT-19 is inherently non-specific, the native state of the RNA can be 
unfolded with much lower efficiency than misfolded states if it is more stable. The native and mis- 
folded structures of the Tetrahymena ribozyme are shown as identical to emphasize the extensive 
native structure in the misfolded form of this RNA and because the specific structural difference is 
not known. A plausible model for the unfolding process is shown, in which CYT-19 first binds to the 
ribozyme non-specifically by interacting with the C-tail, and then the helicase core engages and 
unwinds a duplex in an ATP-dependent reaction, which also requires or results in disruption of a 
tertiary contact. The sequestration and local unwinding of this helix then fosters additional losses 
of structure, which may propagate along the unwound helix or through space, as shown, by the loss 
of tertiary structure. Although this model is consistent with the known properties of CYT-19 and the 
refolding process of this RNA, the specific pathways and mechanisms of unfolding are not known. 



conditions. Interestingly, this lower limit is within 20-fold of the 
maximal flux through the ATPase cycle (see ATPase cycle kinetics 
above). 56 This connection to the ATPase measurements provides 
an important mechanistic insight because it most simply suggests 
that unfolding of the ribozyme requires no more than 20 ATPase 
cycles. These ATPase cycles are presumably used to disrupt RNA 
structural elements, allowing formation of new contacts that lead 
to formation of the native state. It remains to be determined what 
structural elements are disrupted, how many times they are dis- 
rupted on average before the native state is reached, and whether 
the unfolding activity consists solely of unwinding RNA heli- 
ces or whether RNA tertiary contacts are disrupted directly by 
DEAD-box proteins. 

Group II introns. Group II introns are a second major class of 
autocatalytic introns. 125,130 They splice themselves from precursor 
RNAs by a mechanism that involves the formation of an intron 
lariat RNA and is chemically identical to the splicing mecha- 
nism used by nuclear spliceosomal introns in higher organisms. 
The similarity in splicing mechanism and between the structures 
of group II intron domains and snRNAs suggest that group II 
introns were evolutionary ancestors of spliceosomal introns and 
the spliceosome. 129,131 Indeed, some group II introns are mobile 
genetic elements and encode reverse transcriptases that function 
in both intron mobility and RNA splicing ("maturase" activity), 
explaining how group II introns may have initially spread within 
nuclear genomes before evolving into spliceosomal introns. 129 
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Thus, the roles played by DEAD-box proteins in splicing group 

II introns may provide insight into their roles in promoting RNP 
structural transitions in the spliceosome. 6,88 

To catalyze splicing, group II introns fold into a conserved ter- 
tiary structure in which six modular helical domains (DI-DVI) 
interact to form the RNA active site. 132,133 DI is the largest domain 
and provides a scaffold for the assembly of other domains, includ- 
ing DV, which contains key nucleotide residues involved in 
catalysis, and DVI, which contains the branch-point adenosine. 
Like group I introns, the folding of group II introns is promoted 
by intramolecular RNA contacts and by proteins, which vary 
for different group II introns. 129 These proteins include intron- 
encoded reverse transcriptases with maturase activity (see above), 
host-encoded splicing factors that bind specifically to the intron 
RNA to stabilize the active RNA structure, and DEAD-box 
proteins. 22 ' 24 ' 129 ' 134 ' 135 

Yeast mtDNA encodes four group II introns in the genes 
encoding cytochrome b {COB) and subunit I of cytochrome oxi- 
dase (COX1), and all four of these introns require Mssll6 for 
efficient splicing in vivo. 22 ' 24 Two of these introns, all and aI2, 
encode reverse transcriptase /maturases that function as intron- 
specific splicing factors to stabilize the active intron structure. In 
the absence of Mssll6 in vivo, unspliced precursor RNAs con- 
taining aI2 accumulate in a complex with the intron-encoded 
maturase, indicating that Mssll6 is not required for maturase 
binding and may function on the assembled or partially assem- 
bled RNP. Because CYT-19, which was known to function as an 
RNA chaperone in group I intron splicing, could replace Mssll6 
to promote the splicing of all and aI2 in vivo, this later step was 
hypothesized to involve the resolution of kinetic traps. 24 

The remaining two yeast mtDNA group II introns, aI57 and 
bll, do not encode maturases. A key step in analyzing the splic- 
ing of these introns was the development of in vitro systems in 
which purified DEAD-box proteins promote their splicing under 
near physiological conditions in vitro. 21,136 ' 137 These in vitro sys- 
tems better recapitulate biologically-relevant folding processes 
than the previous non-physiological conditions used to study 
these introns, and they have given important insights into the 
roles of DEAD-box proteins in group II intron folding. Mssll6- 
promoted splicing of aI57 and bll in vitro is strictly dependent 
upon ATP and is completely inhibited by motif I mutations that 
ablate ATP binding or hydrolysis. 128,137,138 Reports that the motif 

III mutant of Mssll6 (SAT->AAA) could promote splicing of 
aI57 without unwinding RNA 137,139 were refuted by showing that 
this mutant retains ATP-dependent RNA-unwinding activity for 
short duplexes and that this residual RNA-unwinding activity 
correlates with splicing activity in vitro and in vivo. 128,138 Further, 
CYT-19 and Dedl can replace Mssll6 to promote the splic- 
ing of these introns in vitro and in vivo, indicating non-specific 
interactions, 21,98,136,137 and the ability of different DEAD-box 
proteins to promote splicing in vitro and in vivo also correlates 
with their RNA-unwinding activity. 98,138 Importantly, protease- 
digestion experiments showed that CYT-19 acts on bll as a 
classical RNA chaperone that promotes RNA folding in an ATP- 
dependent manner and is no longer required after RNA folding 
has occurred. 136 Although other DEAD-box protein activities 



may contribute to the splicing of some introns (see below), col- 
lectively, these findings indicate that Mssll6 functions in splic- 
ing yeast mt group II introns as it does in splicing group I introns 
primarily by using its ATP-dependent RNA unwinding activity 
to function as a non-specific RNA chaperone. 

The in vitro splicing systems have also given information on 
the kinetically-trapped structures by demonstrating the involve- 
ment of exon sequences. Early studies showed that the self-splic- 
ing rate of aI57 is strongly decreased by inhibitory sequences in 
the 5' exon, as found for group I introns (see above). 140 More 
recent studies of Mssll6-promoted splicing of aI57 in vitro have 
indicated that long exons do indeed contribute to kinetic traps, 
either by misfolding or by stabilizing inactive structures within 
the introns. 138,139 Exon sequences are likely to make a substan- 
tial contribution to kinetic traps in vivo, where COX1 precursor 
RNAs have long and heterogenous 5' exons, reflecting different 
orders in which introns upstream of aI57 are spliced. 138 

To probe further how DEAD-box proteins promote fold- 
ing steps within the intron, further studies have used ribozyme 
constructs, analogous to those of group I introns, or constructs 
with very short exon sequences that minimize the potential for 
exons to contribute to kinetic traps. Early work indicated that 
under non-physiological conditions of high ionic strength and 
elevated temperature, a construct that includes domains I, HI and 
V (denoted D135 RNA) folds to the native state on the minutes 
time scale without being rate-limited by kinetic traps. 141,142 On 
the other hand, folding of this construct under near-physiologi- 
cal conditions is much slower and more complex, with multiple 
pathways and folding times extending to hours. 138,143 An impor- 
tant advance in our understanding of how Mssll6 promotes 
folding of aI57 under these latter conditions came from a recent 
study in which a single molecule fluorescence approach was used 
to measure folding of a dual-labeled D135 ribozyme. 144 These 
experiments revealed that Mssll6-promoted folding involves two 
discrete transitions that were detected by FRET between the two 
dyes: first there is an ATP-independent step, which most likely 
includes compaction of DI, 143,145,146 and then a second step that is 
ATP dependent. The authors favored the interpretation that this 
second step involves dissociation of bound Mssll6 to permit fur- 
ther folding but left open the possibility of ATP-dependent RNA 
unwinding to resolve a kinetic trap. In a separate study, a two- 
stage assay that separates RNA folding from catalysis showed that 
maximal stimulation of folding requires ATP and that even when 
Mssll6 is removed artificially by proteolysis, ATP-independent 
folding to the native state occurs only for a subpopulation of 
the D135 ribozyme, indicating multiple folding pathways. 138 
Analogous experiments using a construct that included short 
exons indicated that removal of Mssll6 by proteolysis does not 
result in detectable self-splicing, 138 suggesting that even short 
exons increase the probability of kinetic traps late in folding, 
either because they can participate in misfolding or because they 
promote and/or stabilize misfolding within the intron. 

To probe in more detail the folding transitions in the presence 
and absence of Mssll6, a recent study used nucleotide analog 
interference mapping (NAIM) with a native gel separation of 
compact and non-compact RNA, using constructs of aI57 that 
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include short exons. 147 The central findings were that a substan- 
tial number of modifications blocked or slowed compaction, with 
some of the effects larger in the presence of Mssl 16, and that most 
of the interferences resulted from modifications that would be 
expected to weaken base pairs. The authors interpreted the iden- 
tities of the interfering nucleotides to indicate that the formation 
of base pairs is rate-limiting for folding, and they interpreted the 
larger interferences in the presence of Mssl 16 to indicate a greater 
importance of base-pair-forming steps in Mssll6-mediated fold- 
ing. Thus, they suggest that Mssl 16 promotes compaction of Dl 
by structural stabilization of an obligate intermediate and does 
not act as a helicase to unwind misfolded structures within the 
intron. Because these experiments monitored only the first tran- 
sition - compaction - conclusions about other steps in intron 
folding are not warranted. In addition, the experiments in the 
absence of Mssl 16 monitored only the 15—20% of the ribozyme 
population that compacted within 45 min, and not the remain- 
ing 80-85%, the majority of which folds along other pathways 
involving additional steps that may be accelerated by Mssl 16. 

Considered together, the simplest model that emerges from 
the data for aI57 splicing is that Mssl 16 is needed to resolve 
kinetic traps involving the exon sequences and additionally facili- 
tates folding steps within the intron. Mssl 16 promotes a major 
early intron folding step, compaction of Dl, without requir- 
ing ATP. Upon dissociation of Mssl 16, one or more additional 
intermediates are encountered by most of the population for the 
D135 ribozyme and essentially the entire population for long or 
short exon-containing constructs. The intermediate (s) most likely 
includes non-native structures, which may include or be stabilized 
by exons when present, and Mssl 16 functions by using ATP- 
dependent RNA unwinding activity to promote partial unfold- 
ing and permit folding to the native state. Further work is needed 
to refine knowledge of the molecular events associated with the 
multiple folding pathways through this compaction step and the 
acceleration by Mssl 16 along these different pathways, as well as 
to probe the ATP-dependent steps that follow compaction and 
are required for folding of most of the intron population to the 
native state. 

A final point is that although DEAD-box proteins dramati- 
cally accelerate splicing of the aI57 and bll introns, the in vitro 
reactions are still relatively slow (2-4 x 10" 2 min" 1 ), suggesting that 

References ^' 

1. Fairman- Williams ME, Guenther UP, Jankowsky E. 
SF1 and SF2 helicases: family matters. Curr Opin 
Struct Biol 2010; 20:313-24; PMID:20456941; http:// j 
dx.doi.org/10.1016/j.sbi.2010.03.011. 

2. Jankowsky E. RNA helicases at work: binding and 
rearranging. Trends Biochem Sci 2011; 36:19-29; 
PMID:20813532; http://dx.doi.Org/10.1016/j. g 
tibs.2010.07.008. 

3. Linder P, Jankowsky E. From unwinding to clamping 
- the DEAD box RNA helicase family. Nat Rev Mol 
Cell Biol 2011; 12:505-16; PMID:21779027; http:// 9 
dx.doi.org/10.1038/nrm3154. 

4. Noller HE RNA structure: reading the ribosome. 
Science 2005; 309:1508-14; PMID: 16141058; http:// 
dx.doi.org/10.1 126/science.l 1 1 1771. 

5. Carthew RW, Sontheimer EJ. Origins and Mechanisms \Q 
of miRNAs and siRNAs. Cell 2009; 136:642-55; 
PMID:19239886; http://dx.doi.0rg/lO.lOl6/j. 
cell.2009.01.035. 



additional proteins might contribute in vivo and be necessary to 
reconstitute the complete physiological folding pathway for these 
introns in vitro. 

Recent studies have shown that DEAD-box proteins also func- 
tion in splicing of plant mt and chloroplast group II introns. 134 ' 135 
The splicing of these plant group II introns involve large RNP 
complexes with multiple proteins, more analogous to the spliceo- 
some. The roles of DEAD-box proteins in these more complex 
systems remain to be elucidated. 

Future Prospects 

Although the progress in understanding the properties and roles of 
DEAD-box proteins has been substantial, many exciting questions 
remain unanswered. In terms of the biological roles, the full set of 
interactions with cellular RNAs is just beginning to be explored. 
Although DEAD-box proteins such as Mssl 16, CYT-19 and oth- 
ers like Dedl and eIF4A do not have high specificity for indi- 
vidual RNA targets, it remains possible that they have preferred 
physiological targets. Conversely, DEAD-box proteins that are 
targeted to specific RNAs could still interact less specifically with 
other cellular RNAs. Recent studies indicate that the interactions 
of DEAD-box proteins and their effects on RNA structure begin 
during transcription. Mssl 16 has been found to interact with the 
mt RNA polymerase in vivo, positioning it to influence the fold- 
ing of nascent RNAs. 148 The yeast protein Dbp2, the ortholog of 
human p68, was shown to associate with chromatin and prevent 
accumulation of cryptic, non-functional transcripts. 149 This activ- 
ity requires the ability to bind and hydrolyze ATP, and together 
with known properties of DEAD-box proteins suggests that Dbp2 
functions as a chaperone, promoting transient unfolding and rear- 
rangements of these cryptic transcripts to permit their degrada- 
tion. These and other results suggest that the roles of DEAD-box 
proteins as chaperones of RNA folding may be very broad, and 
they highlight the importance of further research toward a com- 
plete molecular understanding of these remarkable proteins. 
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